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IN THE CLAIMS : 

1 . (Currently Amended) A method of performing automatic speech in a variable 
background noise environment the method comprising the steps of: 

processing a first portion of an aud i o signal an inter-sentence pause t o obtain a 
first characterization of the first portion of the audio c l onal inter-sentence pause : 

comparing the first characterization to a set of reference non-speech audio 
characterizations to determine a particular r e f e r e nc e non-speech audio_ characterization 
among the set of fefofeftse non-speech audio characterizations that most closely 
matches the first characterization; 

generating an updated set of non-speech characterizations by updating the 
particular rof e r e nc e non-speech audio c haracterization so that the particular roforo n oo 
non-speech audio characterization more closely resembles the first characterization. 

2. (Currently Amended) The method according to claim 1 further comprising the step of: 

detecting an the inter-sentence pauser-and 

i n r e spon se to th o c t o p of detecting, p e rform i ng th e-s t e p -erf- pr - ooossing th e fir s t 
por tion - of - tho - audio signa l wh o roin tho first port i on of th e audio s ignal ic i nclud e d i n th e 
i n t or c ontonG o pau se. 

3 r (Currently amended) The method according to claim 2 1 wherein: 

the step of processing the first portion of the audio s igna l inter-sentence pause t o 

obtain a first characterization includes a sub-step of: 

processing the first portion of the audio signal inter-sentence pause t o obtain a 

first set of numbers that characterize the first portion of the au tf i o - Gignol inter-sentence 

pause : and 

the step of comparing the first characterization to a set of r e f e renc e non-soeech 
audio characterizations comprises the sub-steps of: 

comparing the first set numbers to a plurality of r e f e rence non-speech audio sets 
of numbers to determining a particular set of reforonco non-soeech audio numbers that 
most closely matches the first set of numbers. 
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4. (Currently amended) The method according to claim 3 wherein the step of updating the 
ffefefenee non-speech audio_ characterization comprises the sub-steps of: 

replacing each number in the particular set of numbers with a weighted average 
of the number and a corresponding number in the first set of numbers. 

5. (Currently amended) The method according to claim 4 wherein the step of comparing 
the first characterization to a set of r o f oron ce non-speech audio c haracterizations comprises the 
sub-steps of; 

taking a dot product between the first set of numbers and each of the plurality of 
r e f e r e n ce non-speech audio sets of numbers. 

6. (Cancelled) 

7. (Currently amended) The method according to claim § 5 wherein 

the plurality of refer e nc e non-speech audio sets of numbers are means of components 
of Gaussian mixtures that characterize the probability of an underlying state of a hidden 
mMartov model of the audio signal, given the firet set of numbers. 

8. (Currently amended) The method according to claim 7 wherein the step of processing 
the first portion of the audio - s i gna l inter-sentence pause t o obtain the first characterization of the 
first portion of the audio cigna l inter-sentence pause c omprises the sub-steps of: 

a) time domain sampling the audio s i gna l inter-sentence pause t o obtain a 
discretized representation of the au d i o - signa l inter-sentence pause t hat 
includes a sequence of samples; 

b) time domain filtering the sequence of samples to obtain a filtered 
sequence of samples; 

c) applying a window function to successive subsets of the filtered sequence 
of samples to obtain a sequence of frames of windowed filtered samples; 

d) transforming each of the frames of windowed filtered samples to a 
frequency domain to obtain a plurality of frequency components; 

e) taking a plurality of weighted sums of the plurality of frequency 
components to obtain a plurality of bandpass filtered outputs; 

f) taking the log of the magnitude of each of the bandpass filtered outputs to 
obtain a plurality of log magnitude bandpass filtered outputs; and 
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g) transforming the plurality of log magnitude bandpass filtered outputs to a 
time domain to obtain at least a subset of the first set of numbers. 

9, (Currently amended) The method according to claim 8 wherein the step of processing 
the first portion of the aud i o signa l inter-sentence pause t o obtain the first characterization of the 
first portion of the audio s ignal inter-sentence pause f urther comprises the sub-steps of: 

repeating sub-steps (a) through (g) for two portions obtained from at least one o f 
the-attd io - oignal inter-sentence pause t o obtain two sets of numbers; and 

taking the difference between corresponding numbers in the two sets of 
numbers to obtain at least a subset of the first set of numbers. 

10. (Currently Amended) An automated speech recognition system comprising: 

an audio signal input for inputting an audio signal that includes speech and 
background sounds; 

a feature extractor coupled to the audio signal input for receiving the audio 
signal and outputting characterizations of a sequence of segments of the audio signal; 

a model coupled to the feature extractor, wherein the model includes a plurality 
of states to which characterization of the sequence of segments are applied for 
evaluating a posteriori probabilities that one or more of the plurality of states occurred; 

a search engine coupled to model for finding one or more high probability 
sequences of the plurality of states of the model; 

a detector for detecting a - opocifio ctato a n absence of speech sounds of the 
audio signal and outputting a predetermined signal when the absence of speech sounds 
is detected; and 

a comparer and updater coupled to the detector for receiving the predetermined 
signal and in response thereto apd at i ng - th e modol c o t hat - it - mor e clos e ly modolG one or 
moro charact o r i aations output by tho foatu r o extractor that oorr es p oft d - to th e specific 
state- determines a mean of a multi component Gaussian mixture associated with 
background sounds that is closest to a feature vector that characterizes the audio signal 
during the absence of speech sounds, and updates the mean so that the mean is closer 
to the feature vector that characterizes the audio signal during the absence of speech 
sounds . 
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1 1 . (Currently Amended) The automated speech recognition system according to claim 10 
wherein: 

the feature extractor outputs characterizations for each of a succession of frames 
that include feature vectors that include cepstral coefficients; 

the model comprises a hidden markov model that includes a plurality of emitting 
states and multi component Gaussian mixtures that give the a posteriori probability that 
a given feature vector is attributable to a given emitting state; 

the detector detects aft the_absence of speech sounds by comparing a function of 
one or more cepstral coefficients to a threshold; and 

tho compa r or and updator dotorminos o m e an of a - rnatti - oomponont Gaucoian 
m i xture associat e d -wi tMBaokground sound &- t h at - is - c l oooct to a f o a t upe v e ctor that 
s tatfacteriz e s th e aud i o -sigFia l dur i ng tho abconco of epoooh cound o , and updat e s the 
m ea n - ea - that - ft -i s clos e r to th o fo at uro - v e otor that oha rao tor i zes - the aud i o s ign aturing 
tho abconc o- of spe e ch sound s. 

1 2. (Currently Amended) An automated speech recognition system comprising: 

an audio input for inputting an audio signal; 

an analog to digital converter coupled to the audio input for sampling the audio 
signal and outputting a discretized audio signal; and 

a microprocessor coupled to the analog to digital converter for receiving the 
discretized audio signal and executing a program for performing automated speech 
recognition, the program comprising programming instructions for: 

detecting an inter-sentence pause of an audio signal; 

processing a first portion of an - audio signa l t he inter-sentence pause t o obtain a 
first characterization of the first portion of the audio s i gna l inter-sentence pause ; 

comparing the first characterization to a set of r e f e r e nc e non-speech audio 
characterizations to determine a particular r e f e r e nc e non-speech audio c haracterization 
among the set of r e f e r e nc e non-speech audio c haracterizations that most closely 
matches the first characterization; and 

updating the particular ref e r e nc e non-speech audio characterization so that the 
particular r e f e r e nc e non-speech audio characterization more closely resembles the first 
characterization. 
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1 3. (Currently Amended) A computer readable medium storing programming instructions for 
performing automatic speech recognition in a variable background noise environment, including 
programming instructions for: 

detecting a plurality of inter-sentence pauses of an audio signal: 

processing a first portion of an audio ciana l- a first inter-sentence pause t o obtain 
a first characterization of the first portion of the aud i o oianol first inter-sentence pause : 

comparing the first characterization to a set of referenc e non-speech audio 
characterizations to determine a particular refefeftee non-speech audio c haracterization 
among the set of roforo nse non-speech audio characterizations that most closely 
matches the first characterization; aiid 

updating the particular roforonco non-speech audio s o that the particular 
refer e nc e non-speech audio characterization more closely resembles the first 
characterization; 

processing one or more additional portions of the audio si gnal plurality of inter- 
sentence pauses t o obtain a-ene ofle or more additional characterizations that 
characterize the one or more additional portions of the audio signa l plurality of inter- 
sentence pauses : 

comparing the one or more additional characterizations to the set of reference 
characterization to find reference characterizations among the set of r e f e r e noo non- 
speech audio characterizations that most closely matches the one or more additional 
characterizations. 

14. (Canceled) 

1 5. (Currently Amended) The computer readable medium according to claim 44 13 wherein: 

the programming instructions for processing the first portion of the audio signa l 
first inter-sentence pause to obtain a first characterization include programming 
instructions for: 

processing the first portion of the aud i o signal first inter-sentence pause to obtain 
a first set of numbers that characterize the first portion of the audio - signa l first inter- 
sentence pause : and 
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the programming instructions for comparing the first characterization to a set of 
fsfsreBGe non-speech audio characterizations comprises the programming instructions 
for: 

comparing the first set numbers to a plurality of ref e ren ce non-speech audio sets 
of numbers to determining a particular set of r e f e r e nc e non-speech audio numbers that 
most closely matches the first set of numbers. 

16. (Currently Amended) The computer readable medium according to claim 15 wherein the 
programming instructions for updating the roforono e non-speech audio characterization 
comprise programming instructions for 

replacing each number in the particular set of numbers with a weighted average 
of the number and a corresponding number in the first set of numbers. 

17. (Currently Amended) The computer readable medium according to claim 16 wherein the 
programming instructions for comparing the first characterization to a set of f oforoftoo non- 
speech audio characterizations comprise programming instructions for: 

taking a dot product between the first set of numbers and each of the plurality of 
roforonoo non-speech audio s ets of numbers. 

18. (Canceled) 

1 9. (Currently Amended) The computer readable medium according to claim 1 8 wherein: 

the plurality of r- oforo n oe non-speech audio s ets of numbers are means of 
components of Gaussian mixtures that characterize the probability of an underlying state 
of a hidden markov model of the audio signal, given the first set of numbers. 

20. (Currently Amended) The computer readable medium according to claim 19 wherein the . 
programming instructions for processing the first portion of the audio - signal first inter-sentence 
pause to obtain the first characterization of the first portion of the audio s igna l first inter- 
sentence pause comprise the programming instructions for 

a) time domain sampling the a tf d fo - s i gna l first inter-sentence pause to obtain a 
discretized representation of the audio signal that includes a sequence of 
samples; 
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b) time domain filtering the sequence of samples to obtain a filtered sequence of 
samples; 

c) applying a window function to successive subsets of the filtered sequence of 
samples to obtain a sequence of frames of windowed filtered samples; 

d) transforming each of the frames of windowed filtered samples to a frequency 
domain to obtain a plurality of frequency components; 

e) taking a plurality of weighted sums of the plurality of frequency components 
to obtain a plurality of bandpass filtered outputs; 

f) taking the log of the magnitude of each of the bandpass filtered outputs to 
obtain a plurality of tog magnitude bandpass filtered outputs; and 

g) transforming the plurality of log magnitude bandpass filtered outputs to a time 
domain to obtain at least a subset of the first set of numbers. 

21 . (Currently Amended) The computer readable medium according to claim 20 wherein the 
programming instructions for processing the first portion of the audio s i gnal first inter-sentence 
pause to obtain the first characterization of the first portion of the audie-siettal first inter- 
sentence pause further comprises programming instructions for: 

applying programming instructions (a) through (g) to two portions of the audi© ' 
signa lfirst inter-sentence pause to obtain two sets of numbers; and 

taking the difference between corresponding numbers in the two sets of 
numbers to obtain at least a subset of the first set of numbers. 
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